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Dynamic Content Based Assisted Information Browsing 



BACKGROUND OF THE INVENTION 



5 1. Field of the Invention 

The present invention relates to the field of computerized information retrieval 
and browsing. More specifically, the present invention relates to methods and 
apparatuses associated with dynamic content based assistance provided to 
facilitate information retrieval and browsing. 

10 

*' * 

|l 2. Background Information 

ill With the recent advances in microprocessor, telecommunication and 

H networking technology, increasing number of computer systems are being 

j£; networked together through private and public networks, such as the Internet. 

is"; ? * 

2 1 5 Volumes of information stored on information servers (such as web servers of the 

World Wide Web) are now available to users of computer systems with a few clicks 
of a mouse button. For many users, the ease of access has actually created an 
information overload situation. Early solution has been to categorize as much of the 
information available as possible, such as the directory or index services offered by 
20 Internet portals like Yahoo and Lycos. Even then, a simple search restricted to one 
categorization could still result in hundreds of potential hits, requiring a large amount 
of user time to sort out the useless from the useful. As a result, users are frustrated, 
and information technology are prevented from realizing its full potential in reaching 
the ubiquitous state. 

25 Various automated techniques in assisting a user in retrieving and browsing 

information have been proposed and/or experimented. In U.S. Patent 5,727,129, 
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entitled "Network system for profiling and actively facilitating user activities", issued 
to Barrett et al, a system and method for assisting a user in accessing information 
stored at remote network sites was disclosed. Under Barrett, an archive is 
maintained of remote sites accessed and instances in which the same remote sites 
are accessed in sequence. Statistics regarding information such as the number of 
times a site has been accessed, and the times a given set of sites have been 
accessed in sequence are maintained. Based on this information, information items 
are identified which the user is predicted to be likely to want to access. In U.S. 
Patent 5,960,429, entitled "Multiple reference hotlist for identifying frequently 
retrieved web pages", issued to Peercy et al, a method and apparatus for locating 
web pages was disclosed. Under Peercy, a count of retrievals of a web page is 
accumulated and the accumulated count and an address for the web page are 
stored in a record of a history log database. A multiple reference hotlist is formatted 
for the user from the records in the history log. Each of these prior art techniques 
has its pros and cons. Thus, additional techniques that can further improve the 
ease of information retrieval and browsing, even in limited circumstances, are 
desired. 
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SUMMARY OF THE INVENTION 

An automated method including automatic provision of additional display of a 
number of identifiers identifying information sources for assisting a user of the client 
5 system in retrieving and browsing information is disclosed. Under the method, in 
response to the retrieval and display on a display of a client system a first 
information page responsive to user direction, the additional display is provided. 
The information source identifiers are dynamically assembled, based at least in part 
on the content of said first information page. In one embodiment, a thumbnail 
10 corresponding to the information page identified by an information page identifier is 



also provided responsive to a user event. 
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BRIEF DESCRIPTION OF DRAWINGS 

The present invention will be described by way of exemplary embodiments, 
but not limitations, illustrated in the accompanying drawings in which like references 
denote similar elements, and in which: 

Figure 1 illustrates an overview of the present invention; 

Figure 2 illustrates the browser and the augmented dynamic content based 
assistance functions and databases of Fig. 1 in further detail in accordance with one 
embodiment; 

Figures 3a-3b illustrate one embodiment each of the related keyword 
database and related information source database of Fig. 2; 

Figure 4 illustrates a manner for designating a current table pair, in 
accordance with one embodiment; 

Figure 5 illustrates a method of the present invention in accordance with one 
embodiment; 

Figures 6a-Sd illustrate four network embodiments for practicing the present 
invention in accordance with four embodiments; and 

Figure 7 illustrates an example digital system suitable for practicing the 
present invention. 
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DETAILED DESCRIPTION OF THE INVENTION 




In the following description, various aspects of the present invention will be 
described. However, it will be apparent to those skilled in the art that the present 
5 invention may be practiced with only some or all aspects of the present invention. 
For purposes of explanation, specific numbers, materials and configurations are set 
forth in order to provide a thorough understanding of the present invention. However, 
it will also be apparent to one skilled in the art that the present invention may be 
practiced without the specific details. In other instances, well known features are 

10 omitted or simplified in order not to obscure the present invention. 

Parts of the description will be presented using terms such as tables, keys, 
identifiers and so forth, commonly employed by those skilled in the art to convey the 
substance of their work to others skilled in the art. Parts of the description will be 
presented in terms of operations performed by a computer system, using terms such 

15 as parsing, accessing, retrieving, and so forth. As well understood by those skilled in 
the art, these quantities take the form of electrical, magnetic, or optical signals 
capable of being stored, transferred, combined, and otherwise manipulated through 
mechanical and electrical components of a digital system; and the term digital system 
include general purpose as well as special purpose data processing machines, 

20 systems, and the like, that are standalone, adjunct or embedded. 

Various operations will be described as multiple discrete steps performed in 
turn in a manner that is most helpful in understanding the present invention, however, 
the order of description should not be construed as to imply that these operations are 
necessarily order dependent, in particular, the order the operations are presented. 

25 Furthermore, the phrase "in one embodiment" will be used repeatedly, however the 
phrase does not necessarily refer to the same embodiment, although it may. 
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Referring now to Figure 1, wherein an overview of the present invention is 
shown. As illustrated, in accordance with the present invention, browser 100 is 
augmented with a number of functions 102 and databases 104 to provide dynamic 
5 content based assistance to a user to retrieve and browse information pages. More 
specifically, functions 102 and databases 104 enable browser 100 to facilitate 
augmented provision of a number of dynamically assembled other information 
source identifiers 106 based at least in part on dynamically determined contents of 
information page 108. (The display of information page 108 is illustrated as 

10 information page display 110.) Information page 108 is intended to represent a 
broad range of informational units known in the art, including but not limited to 
information "documents" formed using mark-up languages, such as HTML and XML. 
In one embodiment, functions 102 further enable browser 100 to provide thumbnails 
112 of information pages corresponding to provided information source identifiers 

15 108. A thumbnail of an information page, as described in more detail below, is a 
dithered down image of the information page. Thumbnails 112 are provided to 
enable the user to have a rough overview of what the information page is about. In 
one embodiment, a thumbnail is provided in response to a user event, such as when 
a user proximately placing a cursor "next" to an information source identifier. What 

20 constitutes "proximately placed" is application dependent. 

Figure 2 illustrates browser 100 as well as augmented functions 102 and 
databases 104 in further detail, in accordance with one embodiment. As illustrated, 
browser 100 includes conventional elements found in many browsers known in the 
25 art, HTTP interface 262, HTML web page handler 264, JAVA™ and JavaScript 
execution engine 266, other script interpreter 268 (e.g. CGI), display interface 270, 
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and a number of "plug-ins" (or add-ons), shown as additional Active-X components 
272-274. Included among these Active-X components 272-274 is a component 272 
that interfaces with selected ones of augmented functions 102, and a dithering 
module 274. Augmented functions 102 include lexical analyzer 282 and search 
5 engine 284, and augmented databases 104 include related keyword database 292 
and related information sources database 294. 

Lexical analyzer 282, for the illustrated embodiment, is used to determine in 
real time, unique nouns presence in the information page browsed. In alternate 
embodiments, lexical analyzer 282 may be used to determine presence of other 
10 types of words/terms presence in the information page, e.g. words/terms that are 
specially tagged. 

Search engine 284 is used to determine in real time, presence of certain 
keywords in the information page being browsed, for the illustrated embodiment, 
using unique nouns outputs by lexical analyzer 282. More specifically, search 

15 engine 284 uses the unique nouns to access related keyword database 292 to 

determine if any of the unique nouns are keywords of interest. Search engine 284 is 
additionally used to determine in real time, related keywords for the presence ones 
of the keywords of interest in the information page being browsed. For the 
illustrated embodiment, search engine 284 makes the determination by retrieving 

20 the related keywords stored in related keyword database 292 for a keyword of 

interest, when a unique noun matches the keyword of interest. Search engine 284 
is further used to determine in real time, the information source identifiers to be 
provided in view of the determined related keywords. For the illustrated 
embodiment, search engine 284 makes the determination by retrieving the 

25 information source identifiers from database 294 using the determined related 
keywords. 
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Lastly, as described earlier, add-on dithering module 274 is used to dither a 
retrieved one of the information pages identified by the augmented information 
source identifiers, to generate a thumbnail for the information page. In one 
embodiment, both the retrieval and dithering of the information page are performed 
5 when the thumbnail is needed. In another embodiment, the information page is pre- 
fetched, and the dithering is performed when the thumbnail is needed. In yet 
another embodiment, the information page is pre-fetched and the dithering is 
performed in advance before the thumbnail is needed. 

Except for the role they play to collectively facilitate practice of the present 

10 invention, in and of the themselves, each of the enumerated functions, interface 
272, dithering module 274, lexical analyzer 282, and searching engine 284 are 
generally known in the art, and may be implemented using any one of a number of 
techniques and programming languages known. Accordingly, they will not be 
otherwise further described. 

15 Figures 3a=3b illustrate one embodiment each of related keyword database 

292 and information source database 294. For the illustrated embodiment, related 
keyword database 292, as shown in Fog. 3a, is constituted with a number of tables 
302. Stored inside each table 302 are related keyword entries 304. Each related 
keyword entry 304 includes related keywords 308 for a keyword of interest 306. 

20 Similarly, for the illustrated embodiment, information source database 294, as 
shown in Fiig. 3b, is constituted with a number of tables 322. Stored inside each 
table 322 are information source entries 324. Each information source entry 324 
includes information source identifiers 328 for a related keyword 308. 

For the illustrated embodiment, keywords of interest 306 (and therefore, their 

25 related keywords 308 and associated information source identifiers 328) are 

organized by categories or subjects. For examples, a series of tables may contain 
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related keywords for various keywords of interest for Biology, Physics, Chemistry, 
History, Literature, and so forth; another series of tables may contain related 
keywords for various keywords of interest for Doctors, Dentists, Lawyers, 
Accountants, Financial Advisers, and so forth, and yet another series of tables may 
5 contain related keywords for various keywords of interest for Classical Music, Rock 
and Roll Music, Country Western Music, and so forth. There is no limitation to how 
the tables may be organized. 

For the illustrated embodiment, one of the corresponding table pairs 302 and 
322 is designated as the "current" table pair. During operation, search engine 284 

10 accesses only table 302 of the designated "current" table pair to determine whether 
certain keywords of interest are presence in an information page being browsed. 
The associated information source identifiers for each determined presence ones of 
keywords of interest are retrieved from the corresponding information source table 
322 of the designated "current" table pair. 

15 Fnguire 4 illustrates a manner in which the "current" table pair may be 

designated, in accordance with one embodiment. The manner illustrated is similar 
to the "tab" based pop-up approach in setting browser or software options in 
general. For the illustrated embodiment, option "pop up" 402 includes tab page 404 
for setting various content related options. Among the various settable content 

20 related options is field 406 for entering the name of the keyword of interest and 

information source table pair to be designated as the "current" table pair. Field 406 
has associated browse button 408 for displaying a list of table-pairs available for 
selection. The approach is basically known in the art, accordingly, will not be 
otherwise further described. 

25 The information contained in the various tables may be tabulated manually or 

using an automated process or a combination of both, by a vendor. The tabulation 
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may be repeated periodically to continually refine the keywords of interest, their 
relationship to other keywords, as well as the associated information sources. 

Figure 5 illustrates a method of the present invention, in accordance with one 
5 embodiment. As illustrated, at 502, unique nouns of an information page are 
identified. In the earlier described embodiment, the identification is performed by 
the lexical analyzer. At 504, the unique nouns are determined if they are to be 
considered as keywords of interest. In the earlier described embodiment, this is 
performed by the search engine accessing the related keyword database. At 506, 
10 the related keywords are determined for the presence ones of the keywords of 
interest. In the earlier described embodiment, this is also performed by the search 
y : l engine accessing the related keyword database. At 508, the associated information 

>V»4 to 

y sources, i.e. their identifiers, are identified for each of the retrieved related keywords 

for the presence ones of the keywords of interest. In the earlier described 

#» ■ * to 

15 embodiment, this is performed by the search engine accessing the information 
source database. 

At 510, the information pages corresponding to the retrieved information 
source identifiers are pre-fetched. In one embodiment, this is performed by the 
browser itself. At 512, the retrieved information pages are dithered to generate the 

20 thumbnails to have them available for use on demand. In the earlier described 
embodiment, this is performed by the add-on dither module. 

While for ease of understanding, the method has been described with the 
operations being performed one after the other, first the unique nouns are 
determined, then the keywords of interest are determined, as so forth. Those skilled 

25 in the art will appreciate that in alternate embodiments, the various operations may 
be performed in an overlapped or interleaving manner. That is, as soon as a unique 
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noun is determined, whether it is a keyword of interest may be determined, without 
awaiting all unique nouns to be determined. Likewise, once a keyword of interest is 
identified, its related keywords may be determined, without awaiting all keywords of 
interest to be identified. These principles apply equally to all the later operations. 



Figures 6a=6d illustrate four example network environments for practicing the 
present invention in accordance with four embodiments. The embodiment of Fig. 6a 
represents an embodiment, where all the relevant earlier described elements, i.e. 
add-on interface and dither modules 272 and 274, lexical analyzer 282, search 

10 engine 284, and databases 292 and 294 are all provided to a client system 602 
coupled to a network (e.g. the Internet). These relevant elements may be loaded 
onto client system 602 via a distribution medium (not shown) or downloaded from a 
distribution server (not shown). Note that a user may load/download only a subset 
of databases 292 and 294 tabulated by a vendor, or load/download them from 

1 5 multiple vendor sources. 

The embodiment of Fig. 6b represents an alternate embodiment, where 
except for information source database 294, all other relevant earlier described 
elements, i.e. add-on interface and dither modules 272 and 274, lexical analyzer 
282, search engine 284, and databases 292 are provided to a client system 602 

20 coupled to a network (e.g. the Internet). Information source database 294 is 
disposed on portal or service server 604 instead. During operation, upon 
determining the related keywords, search engine 284 would access database 294 
remotely to retrieve the associated information source identifiers. 

The embodiment of Fig. 6c represents yet another alternate embodiment, 

25 where only add-ons 272 and 274 and lexical analyzer 282 are provided to a client 
system 602 coupled to a network (e.g. the Internet). Otherwise, search engine 284, 
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and databases 292 and 294 are disposed on portal or service server 604 instead. 
During operation, upon determining the unique nouns, lexical analyzer 282 would 
provide the unique nouns to search engine 284 on serer 604 instead. Search 
engine 284, in turn determines the keywords of interest, their related keywords, and 
5 ultimately the associated information source identifiers, and provide them to client 
602, as described earlier. 

The embodiment of F5g. 6d represents yet another alternate embodiment, 
where all the relevant earlier described elements, i.e. interface and dither modules 
272 and 274, lexical analyzer 282, search engine 284, and databases 292 and 294 

10 are all disposed on one or more portal or service server 604 instead. During 
operation, browser 100 would keep interface 272 informed of the identity of the 
information page being browsed (e.g. in the case of a web page, providing interface 
272 with the web page's URL). Interface 272 causes server 604 to obtain its own 
copy of the information page. Therefore, lexical analyzer 282 and search engine, 

15 would all function (using databases 292 and 294) as described earlier, leading to the 
eventual provision of the information source identifiers to client 602. 



Figure 7 illustrates one embodiment of an exemplary digital system suitable 
for use to practice the present invention, either as a client system or a server 

20 system. As a client system, digital system 700 may be a desktop computer system, 
a laptop computer system, a palm sized computing device, a set-top box, an 
Internet appliance and the like. As a server, digital system 700 may a single or a 
cluster of computer systems. As shown, exemplary digital system 700 includes one 
or more processors 702 and system memory 704. Additionally, system 700 includes 

25 mass storage devices 706 (such as diskette, hard drive, CDROM and so forth), 
input/output devices 708 (such as keyboard, cursor control and so forth) and 
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communication interfaces 710 (such as network interface cards, modems and so 
forth). The elements are coupled to each other via system bus 712, which 
represents one or more buses. In the case of multiple buses, bridged by one or 
more bus bridges (not shown). Each of these elements perform its conventional 
functions known in the art. In particular, system memory 704 and mass storage 706 
are employed to store a working copy and a permanent copy of the programming 
instructions implementing the teachings of the present invention. The permanent 
copy of the programming instructions may be loaded into mass storage 706 in the 
factory, or in the field, as described earlier, through a distribution medium (not 
shown) or through communication interface 710 (from a distribution server (not 
shown). The constitution of these elements 702=712 are known, and accordingly will 
not be further described. 

Thus, a novel method and apparatus for dynamically assisting a user in 
information retrieval and browsing, based on the content of an information page has 
been described. While the present invention has been described in terms of the 
above illustrated embodiments, those skilled in the art will recognize that the 
invention is not limited to the embodiments described. The present invention can be 
practiced with modification and alteration within the spirit and scope of the appended 
claims. The description is thus to be regarded as illustrative instead of restrictive on 
the present invention. / 
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